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Amendments to the Claims 

This listing of claims will replace all prior versions, and listings of claims in the 
application. 

Claims 1-7. (Cancelled) 

Claim 8. (Previously presented) A superscalar microprocessor for processing 
instructions, the microprocessor comprising: 

an instruction fetch unit configured to fetch instructions from an instruction store 
according to a sequential program order; 

a branch prediction circuit configured to provide a branch bias signal indicating 
whether a conditional branch controlled by a conditional branch instruction is predicted 
to be taken or not taken; 

an instruction buffer coupled to receive fetched instructions from the instruction 
fetch unit and configured to buffer a plurality of fetched instructions, including an 
instruction selected according to the branch bias signal; 

a plurality of functional units configured to execute instructions, thereby 
generating result data; 

a register file including a plurality of entries configured to store data including 
result data generated by the plurality of functional units, wherein each of the plurality of 
entries is accessible by reference to a respective location in the register file; 
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a resource identifying circuit configured to concurrently identify execution 

resources for a plurality of buffered instructions, thereby making a plurality of 

instructions concurrently available for issue, wherein the identified execution resources 

for each of the available instructions includes a functional unit capable of executing the 

instruction; 

a register rename circuit configured to provide references to locations in the 
register file for logical register references included with the plurality of buffered 
instructions; 

an issue control circuit coupled to the resource identifying circuit and configured 
to concurrently issue more than one of the available instructions to the functional units 
for execution, based on availability of the identified execution resources for each 
instruction and availability of respective operands for each instruction in the referenced 
locations in the register file, without regard to the sequential program order; 

a plurality of data routing paths coupled between the plurality of functional units 
and the register file and configured to concurrently transfer result data from more than 
one of the plurality of functional units to the register file; and 

bypass control logic coupled to the plurality of data routing paths and configured 
to distribute result data from a first one of the plurality of functional units as operand 
data for another one or more of the plurality of functional units via an alternate data path 
that bypasses the register file, wherein distributing result data via the alternate data path 
occurs concurrently with transferring result data to the register file. 

Claim 9. (Previously presented) The microprocessor of claim 8, wherein: 
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the plurality of functional units includes an integer functional unit and a floating- 
point functional unit; and 

the bypass control logic is further configured such that an integer result from the 
integer functional unit is distributed to the floating-point functional unit via the alternate 
data path. 

Claim 10. (Previously presented) The microprocessor of claim 8, wherein: 
the plurality of functional units includes an integer functional unit and a floating- 
point functional unit; and 

the bypass control logic is further configured such that a floating-point result 
from the floating-point functional unit is distributed to the integer functional unit via the 
alternate data path. 

Claim 1 1 . (Previously presented) The microprocessor of claim 8, further 
comprising: 

operand data routing paths coupled between the register file and the functional 
units and configured to concurrently transfer operand data to more than one of the 
functional units. 

Claim 12. (Previously presented) The microprocessor of claim 11, wherein the 
operand data routing paths transfer operand data directly from the register file to the 
functional units. 
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Claim 13. (Cancelled) 



Claim 14. (Previously presented) A method for processing instructions in a 
superscalar microprocessor, the method comprising: 

fetching instructions from an instruction store according to a sequential program 

order; 

predicting whether a conditional branch controlled by a conditional branch 
instruction included in the fetched instructions is taken or not taken; 

buffering a plurality of fetched instructions, including an instruction selected 
according to the prediction, in an instruction buffer; 

concurrently identifying execution resources for more than one of a plurality of 
buffered instructions, the identified execution resources for each of the more than one of 
the plurality of buffered instructions including a functional unit capable of executing the 
instruction; 

providing references to locations in a register file for logical register references 
included with the plurality of buffered instructions, wherein the register file includes a 
plurality of entries, each of the plurality of entries being accessible by reference to a 
respective location in the register file; 

concurrently making available for execution a plurality of instructions for which 
execution resources are identified and register file location references are provided; 

concurrently issuing more than one of the plurality of available instructions for 

execution by a plurality of functional units, based on availability of the identified 

execution resources for each available instruction and availability of respective operands 
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for each instruction in the referenced locations in the register file, without regard to the 

sequential program order; 

executing the issued instructions in the plurality of functional units, thereby 

generating result data; 

transferring the result data from the functional units to a the register file; and 
concurrently with said act of transferring, distributing the result data from a first 

one of the plurality of functional units as operand data for another one or more of the 

plurality of functional units via a bypass data path that bypasses the register file. 

15. (Previously presented) The method of claim 14, wherein: 

the plurality of functional units includes an integer functional unit and a floating 
point functional unit; and 

the act of distributing the result data includes distributing result data from the 
integer functional unit to the floating point functional unit via the bypass data path. 

16. (Previously presented) The method of claim 14, wherein: 

the plurality of functional units includes an integer functional unit and a floating 
point functional unit; and 

the act of distributing the result data includes distributing result data from the 
floating point functional unit to the integer functional unit via the bypass data path. 

17. (Previously presented) The method of claim 14, further comprising: 
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concurrently transferring operand data from the register file to more than one of 

the functional units via a plurality of operand data routing paths. 



18. (Previously presented) The method of claim 17, wherein the operand data 
routing paths transfer operand data directly from the register file to the functional units. 



19. (Cancelled) 



Claim 20. (Previously presented) The microprocessor of claim 8, further 
comprising retirement control logic coupled to the register file and configured to 
concurrently retire a plurality of instructions according to the sequential program order. 

Claim 21. (Previously presented) The microprocessor of claim 20, wherein the 
register file includes: 

a temporary buffer having a first plurality of entries; and 

a retired register array having a second plurality of entries; 

and wherein the retirement control logic is further configured such that when an 
instruction is retired, corresponding result data is transferred from the temporary buffer 
to the retired register array. 

Claim 22. (Previously presented) The method of claim 14, further comprising 
retiring instructions according to the sequential program order. 
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Claim 23. (Previously presented) The method of claim 22, wherein the register 

file includes: 

a temporary buffer having a first plurality of entries; and 
a retired register array having a second plurality of entries; 
and wherein the act of retiring an instruction includes transferring corresponding 
result data from the temporary buffer to the retired register array. 

Claim 24. (Previously presented) The microprocessor of claim 8, wherein the 
resource identifying circuit is further configured to concurrently identify execution 
resources for a first one and a second one of the plurality of buffered instructions, 
wherein the second one of the instructions has a data dependency on the first one of the 
instructions. 

Claim 25. (Previously presented) The microprocessor of claim 8, wherein the 
register rename circuit is further configured to concurrently provide references to 
locations in the register file for a first and a second one of the buffered instructions, 
wherein the second one of the buffered instructions has a data dependency on the first 
one of the buffered instructions. 

Claim 26. (Previously presented) The microprocessor of claim 8, wherein the 
logical register references include at least one of a source register reference and a 
destination register reference. 
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Claim 27. (Previously presented) The method of claim 14, wherein concurrently 

identifying execution resources includes concurrently identifying execution resources for 

a first one and a second one of the plurality of buffered instructions, wherein the second 

one of the instructions has a data dependency on the first one of the instructions. 

Claim 28. (Previously presented) The method of claim 14, wherein providing 
references to locations in the register file includes concurrently providing references to 
locations in the register file for at least a first one and a second one of the buffered 
instructions, wherein the second one of the buffered instructions has a data dependency 
on the first one of the buffered instructions. 

Claim 29. (Previously presented) The method of claim 14, wherein the logical 
register references include at least one of a source register reference and a destination 
register reference. 

Claim 30. (New) A method of executing instructions using a microprocessor, 
the method comprising: 

pre-fetching an instruction group including a plurality of instructions from a 
memory in a first processor cycle and holding the instruction group in a pre- fetch buffer, 
the pre-fetching accomplished so that instruction groups can be returned out of program 
order and subsequently reordered; 
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transferring the instruction group held in the pre- fetch buffer to a multiple-stage 
buffer when there is a vacancy in the multiple-stage buffer with sufficient capacity to 
handle the instruction group as a unit; 

simultaneously decoding, in a processor cycle after the first processor cycle, a 
plurality of instructions that are included in the instruction group, the decoding 
performed with at least one instruction at a predetermined position in multiple-stage 
buffer; 

checking, in a processor cycle after the decoding, a dependency relation between 
plural decoded instructions at least on the basis of registers to be used by the plural 
decoded instructions; 

allocating instructions, for instructions judged to have no restriction on execution 
due to dependency, to a plurality of functional units so that at least one instruction 
executes outside the program order; 

executing, in a processor cycle after decoding, the allocated instructions using the 
plurality of functional units; and 

removing the instruction group held in the multiple-stage buffer from the buffer 
and advancing an other instruction group held in the multiple-stage buffer so that another 
instruction from the other instruction group moves to the predetermined position. 

Claim 31. (New) The method of claim 30, wherein the predetermined position 

of the multiple-stage buffer is defined by a decoder that identifies a plurality of 

instructions for simultaneous transfer, the predetermined position including at least a 

final stage of the multiple-stage buffer, the final stage holding a plurality of instructions. 
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Claim 32. (New) The method of claim 30, wherein a processor cycle in which 
the plurality of instructions included in the instruction group are held in the multiple- 
stage buffer is a second processor cycle that follows the first processor cycle. 

Claim 33. (New) The method of claim 3 1 , wherein decoding the plurality of 
instructions begins in a second processor cycle following the first processor cycle. 

Claim 34. (New) The method of claim 33, wherein the checking the dependency 
relation and the allocating instructions to the plurality of functional units begins in a third 
processor cycle following the second processor cycle. 

Claim 35. (New) The method of claim 30, wherein the microprocessor reads 
data from an external memory through a bus. 

Claim 36. (New) A data processing apparatus comprising a superscalar type 
microprocessor having a plurality of functional units that can execute instructions 
simultaneously, the microprocessor comprising: 

a pre-fetch unit that pre- fetches an instruction group including a plurality of 
instructions from a memory, the plurality of instructions having a predetermined 
program order; 

a buffer that buffers a plurality of instruction groups pre-fetched by the pre-fetch 

unit; 
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a decoder that decodes a plurality of instructions provided by the buffer, the 
plurality of instructions including at least one instruction group that has its instructions 
decoded simultaneously; 

a register file including a register array having a plurality of registers used by at 
least one functional unit in executing decoded instructions; 

a dependency check unit that checks for a dependency relation between the 
plurality of instructions provided by the decoder, on the basis of which registers are used 
by the plurality of instructions; and 

an instruction unit that allocates an instruction to a functional unit so that the 
instruction executes outside the predetermined program order, the instruction judged by 
the dependency check unit not to be subject to restriction due to a dependency; 

wherein when a plurality of instructions of the instruction group are all retired, an 
entry in the buffer corresponding to the instruction group is released. 

Claim 37. (New) The data processing apparatus of claim 36, further comprising 
a memory from which the microprocessor accesses data through a system memory bus to 
allow execution of the instruction group. 

Claim 38. (New) A data processing apparatus comprising a super scalar type 
microprocessor having a plurality of functional units that can execute instructions 
simultaneously, the microprocessor comprising: 
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a pre- fetch unit that pre-fetches a plurality of instructions from a memory in 
preparation for execution by one or more functional units, the plurality of instructions 
having a predetermined program order; 

a buffer that holds a plurality of instructions, including one or more instructions 
pre- fetched by the pre-fetch unit; 

a decoder that simultaneously decodes a plurality of instructions from 
instructions held in the buffer; 

a register file including a plurality of registers used in the one or more functional 
units executing the plurality of decoded instructions; 

a dependency check unit that checks for a dependency relation between the 
plurality of instructions output from the decoder, on the basis of use conditions stored in 
a register; 

an instruction unit that allocates an instruction to a functional unit so that the 
instruction executes outside the predetermined program order after the instruction is 
judged by the dependency check unit not to be subject to restriction due to a dependency; 
and 

a retirement unit that specifies a register in which to store a result of executing 
the instruction outside the predetermined program order, wherein the retirement unit 
retires the instruction in program order after the instruction is completed. 

Claim 39. (New) The data processing apparatus of claim 38, wherein 
the register file has a temporary buffer that stores results from execution of 
instructions outside the predetermined program order, and 
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when executing instructions outside the predetermined program order, the 

microprocessor uses the temporary buffer, and when completing the instructions 

executed outside the predetermined program order, contents of the temporary buffer are 

written in a corresponding register to retire the instructions. 

Claim 40. (New) The data processing apparatus of claim 38, further comprising 
a memory from which the microprocessor accesses data through a system memory bus to 
allow execution of the instruction group. 

Claim 41. (New) A data processing apparatus comprising a superscalar type 
microprocessor having a plurality of functional units that can execute instructions 
simultaneously, the microprocessor comprising: 

a pre-fetch unit that pre-fetches an instruction group including a plurality of 
instructions from a memory in preparation for executing the plurality of instructions in 
the functional units; 

a buffer having plural stage registers that can transfer stored data in a forward 
direction and that stores the plurality of instructions of the pre- fetched instruction group; 

a decoder that simultaneously decodes a plurality of instructions held in at least a 
final stage register of the buffer; 

an allocating circuit that allocates the decoded plurality of instructions to the 
plurality of functional units for execution, at least one instruction executed outside a 
predetermined program order; and 
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an instruction completion unit that advances contents of a plurality of registers of 

the buffer in the forward direction by a number of stages that correspond to a number of 

groups of completed instructions. 



Claim 42. (New) The data processing apparatus of claim 41, further comprising 
a memory from which the microprocessor accesses data over a system memory bus to 
facilitate execution of the instruction group. 
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